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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANT: Matalon, Reuben 
Kaul , Rajinder 
Gao, Guang Ping 
Balamurugan, Kuppareddi 
Michals-Ratalon, Kimberlee 

(ii) TITLE OF INVENTION: Aspartoacylase Gene, Protein, and 

Methods of Screening for Mutations Associated with Canavan 
Disease 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: . _ 

(A) ADDRESSEE: Mill en. White, Zelano & Bramgan P.C. 

(B) STREET: 2200 Clarendon Boulevard, Suite 1400 

(C) CITY: Arlington 

CD.) .STATE : . Vi rgini a. ..• „ ,~. _ 

. 1XE~) -COUNTRY': .U.S. A. l^Z^^l^^S..^ 

r.;: :: , A ._ •; :{|Du:gIP: "2220i , . . 

(v) COMPUTER READABLE FORM: -■ — • 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS . 

(D) SOFTWARE: Patentln Release #1.0. Version #1.25 

(vi) CURRENT APPLICATION DATA: non 
(A) APPLICATION NUMBER: US 08/128,020 
1 (B) FILING DATE: 29-SEP-1993 

: (O CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Hamlet -King, Diana 

(B) REGISTRATION NUMBER: 33.302 

(C) REFERENCE/ DOCKET NUMBER: Shutt 1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-243-6333 

(B) TELEFAX: 703-243-6410 

(C) TELEX: 64191 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 159. .1097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TTGTAACAGA AAATTAAAAT ATACTCCACT CAAGGGAATT CTGTACTTTG CCCTTTTGGT 60 
AAAGTCTCAT TTACATTTCT AAACCTTTCT TAAGAAAATC GAATTTCCTT TGATCTCTCT 120 
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TCTGAATTGC AGAAATCAGA TAAAAACTAC TTGGTGAA ATG ACT TCT TGT CAC 173 

Met ihr Ser Cys His 
1 5 

ATT GCT GAA GAA CAT ATA CAA AAG GTT GCT ATC TTT GGA GGA ACC CAT 221 
He Ala Glu Glu His He Gin Lys Val Ala He Phe Gly Gly Thr His 
10 15 20 

GGG AAT GAG CTA ACC GGA GTA TTT CTG GTT AAG CAT TGG CTA GAG AAT 269 
Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys His Trp Leu Glu Asn 
.25 30 35 

C-GC GCT GAG ATT CAG AGA ACA GGG CTG GAG GTA AAA CCA TTT ATT ACT 317 
Gly Ala Glu He Gin Arg Thr Gly Leu Glu Val Lys Pro Phe lie Thr 
40 45 50 

AAC CCC AGA GCA GTG AAG AAG TGT ACC AGA TAT ATT GAC TGT GAC CTG 365 
Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr lie Asp Cys Asp Leu 
: 55 60 ■ 65 

AAT CGC ATT TTT GAC CTT GAA AAT CTT GGC AAA AAA ATG TCA GAA GAT 413 
Asn Arg. He Phe Asp leu Glu Asn Leu Gly Lys Lys Met Ser Glu Asp 

,7G- -~ . . . 75,....,.;;; . .. ,: .so.;. .. 8b 

TTG CCA TAT GAA GTG AGA AGG GCT CAA GAA ATA AAT CAT TTA TTT GGT 461 
Leu Pro Tyr Glu Val" Arg Arg "Ala Gin Glu He Asn His Leu Phe Gly 
90 95 100 

CCA AAA GAC AGT GAA GAT TCC TAT GAC ATT ATT TTT GAC CTT CAC AAC 509 
Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He Phe Asp Leu His Asn 
105 110 115 

. ACC ACC TCT AAC , ATG GGG TGC- ACT CTT ATT CTT GAG GAT TCC AGG AAT 557 
Thr Thr Ser Asn Met Gly- Cys Thr Leu He Leu Glu Asp Ser Arg Asn 
: C . 120 - • - - 125 ■ 130 

AAC TTT TTA ATT CAG ATG TTT CAT TAC ATT AAG ACT TCT CTG GCT CCA 605 
Asn Phe Leu He Gin Met Phe His Tyr He Lys Thr Ser Leu Ala Pro 
135 140 145 

CTA CCC TGC TAC GTT TAT CTG ATT GAG CAT CCT TCC CTC AAA TAT GCG 653 
Leu Pro Cys Tyr Val Tyr Leu lie Glu His Pro Ser Leu Lys Tyr Ala 
150 155 160 165 

ACC ACT CGT TCC ATA GCC AAG TAT CCT GTG GGT ATA GAA GTT GGT CCT 701 
Thr Thr Arg Ser lie Ala Lys Tyr Pro Val Gly lie Glu Val Gly Pro 
170 175 180 

CAG CCT CAA GGG GTT CTG AGA GCT GAT ATC TTG GAT CAA ATG AGA AAA 749 
Gin Pro Gin Gly Val Leu Arg Ala Asp He Leu Asp Gin Met Arg Lys 
185 190 ■ 195 

ATG ATT AAA CAT GCT CTT GAT TTT ATA CAT CAT TTC AAT GAA GGA AAA 797 
Met He Lys His Ala Leu Asp Phe He His His Phe Asn Glu Gly Lys 
200 205 210 



GAA TTT CCT CCC TGC GCC ATT GAG GTC TAT AAA ATT ATA GAG AAA GTT 845 
Glu Phe Pro Pro Cys Ala He Glu Val Tyr Lys He lie Glu Lys Val 
215 220 225 
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GAT TAC CCC CGG GAT GAA AAT GGA GAA ATT GCT GCT ATC ATC CAT CCT 
Asp Tyr Pro Arg Asp Glu Asn Gly Glu He Ala Ala lie He His Pro 
230 235 . 240 245 

AAT CTG CAG GAT CAA GAC TGG AAA CCA CTG CAT CCT GGG GAT CCC ATG 
Asn Leu Gin Asp Gin Asp Trp Lys Pro Leu His Pro Gly Asp Pro Met 
250 255 260 

TTT TTA ACT CTT GAT GGG AAG ACG ATC CCA CTG GGC GGA GAC TGT ACC 
Phe Leu Thr Leu Asp Gly Lys Thr He Pro Leu Gly Gly Asp Cys Thr 
265 270 - 275 

GTG TAC CCC GTG TTT GTG AAT GAG GCC GCA TAT TAC GAA AAG AAA GAA 
Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr Tyr Glu Lys Lys Glu 
280 285 290 

GCT TTT GCA .AAG AGA ACT AAA CTA ACG CTC AAT GCA AAA AGT ATT CGC 
Ala Phe Ala Lys Thr Thr L^s Leu Thr Leu Asn Ala Lys Ser He Arg 



295 



305 



TGC TGT TTA- CAT TAGAAATCAC TTGCAGCTTA CATCTTACAC GGTGTCTTAC 
Cys Cys Leu His'- ' - 

3ip _ .. ... ;: , . - ; ' - 

AMTTCTGCT/: AGTCTGJ^G = CTCGTTAAGA-GTAGGGTTGT,GCCTTATTCA ACTGCATACA 
TAGCTCCTAG CACAGTGCCT TATTCGGTAG GCATCTAAGC AAATTTCTTA AATTAATTAA 
TATATCTTTA AAGATATCAT ATTTTATGTA TGTAGCTTAT TCAAAGAAGT GTTTCCTATT 
TCTATATAGT TTATTATACA TGATACTTGG GTAGCTCAAC -ATTCTTAATA AACAGCCTTT 
GTATTCAGAA JATAAAATTG AAATAGATAT -ATATAAAGTT AAAAAAAAAA AAAAAAAA 



893 

941 

989 

1037 

1085 

1137 

1197 
1257 
1317 
1377 
1435 



.(2) INFORMATION FOR SEQ ID NO:2: 

Ci) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 313 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 83 

(D) OTHER INFORMATION: /note- 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 105 

(D) OTHER INFORMATION: /note= 

(ix) FEATURE: 

(A) NAME/ KEY: Modi fied-site 

(B) LOCATION: 108 

(D) OTHER INFORMATION: /note- 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 146 

(D) OTHER INFORMATION: /note- 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site. 



"Phosphorylation site' 



'Phosphorylation site" 



'Phosphorylation site" 



"Phosphorylation site" 
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(B) LOCATION: 264 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modi tied -site 

(B) LOCATION: 117 

(D) OTHER INFORMATION: /note- "Potential N-glycosylation 
site" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 18.. 24 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 275. .'278" 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be.- involved in catalysis" 

(ix) FEATURE: . 

(A) NAME/KEYc Active-site 
. - . (B) LOCATION: 28.3 ...289 

[':.-■ --(D) OTHER-' INFORMATIONS /note- '"Consensus sequence 
■ f predicted to be involved in catalysis" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Thr Ser Cys His He Ala Glu Glu His He Gin Lys Val Ala He 
15 10 15 

-■Phe Sly Gly Thr His Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys 
20 - ■ ^ 25 30 

.-'His-Trp Leu Glu Asn Gly Ala Glu He Gin Arg Thr Gly Leu Glu Val 
. 35 40 45 

Lys Pro Phe lie Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
50 55 60 

He Asp Cys Asp Leu Asn Arg He Phe Asp Leu Glu Asn Leu Gly Lys 
65 70 75 80 

Lys Met Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He 
100 105 110 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
115 120 125 

Glu Asp Ser Arg Asn Asn Phe Leu He Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala Pro Leu Pro Cys Tyr Val Tyr Leu lie Glu His Pro 
145 150 155 160 

Ser Leu Lys Tyr Ala Thr Thr Arg Ser He Ala Lys Tyr Pro Val Gly 
165 170 175 

lie Glu Val Gly Pro Gin Pro Gin Gly Val Leu Arg Ala Asd He Leu 
180 185 IT 
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Asp Gin Met Arg 
. 195 


Lys 


Met 


lie 


Lys 
2D0 


His Ala 


i a r*\ f TT ii* II* 

Leu Asp Phe He His His 
205 


Phe Asn Glu Gly 
210 


Lys 


Glu 


Phe 
215 


Pro 


Pro Cys 


Ala He Glu Val Tyr Lys 
220 


He He Glu Lys 
225 


Val 


Asp 
230 


Tyr 


Pro 


Arg Asp 


Glu Asn Gly Glu He Ala 
235 240 


Ala He He His 


Pro 
245' 


Asn 


Leu 


Gin 


Asp Gin 
250 


Asp i rp lys pro Leu his 
255 


Pro Gly Asp Pro 
260 


Met 


Phe 


Leu 


Thr 


Leu Asp 
255 


Gly Lys Thr He Pro Leu 
270 


Gly Gly Asg Cys 


Thr 


Val 


Tyr 


Pro 
280 


Val Phe 


Val Asn Glu Ala Ala Tyr 
285 


Tyr Glu Lys' Lys 
290 .... ; 


Glu 


Ala 


Phe 
295 


Ala 


Lys Thr 


Thr Lys Leu Thr Leu Asn 
• 300 


Ala -Lys Ser He 
305. . .. 


Arg 


Cys 
310 


Cys 


Leu 


His 





( 2 )' I NFORMATION" FOR" SEQ ID NO: 3: - 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

„.., . (ix) FEATURE: ' 

(A) NAME/ KEY: Region 

(B) LOCATION: 6 mti _. ...... 

(D) OTHER INFORMATION: /note= This is isoleucine in 

human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/ KEY: Region 

(B) LOCATION: 9 . nTU . . n + . 

(D) OTHER INFORMATION: /note- This is glutamic acid m 
human, aspartic acid in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 10 . . . , 

(D) OTHER INFORMATION: /note- This is histidine in human, 
proline in bovine. This is a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /note- "This is glutamine in human, 
lysine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME /KEY: Region 

(B) LOCATION: 38 



25vi 

(D) OTHER INFORMATION: /note= "This is glycine in human, 
serine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 39 n . . . 

(D) OTHER INFORMATION: /note= "This is alanine in human, 
threonine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 72 . . . . . 
(D) OTHER INFORMATION: /note= This is isoleucine in 

human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 75 

(D) OTHER INFORMATION: /note- This is leucine in human, 
" :' " proline in bovine. This is not a conservative 

substitution. " • 

(ix) FEATURE: - 

(A) NAME/KEY: Region 

(B) LOCATION: 82 J _ u . . ~ . 
(D) OTHER INFORMATION: /note= This is methionine in 

human, lysine in bovine. This is a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 134 . utu . . 

(D) OTHER INFORMATION: /note- This is asparagine in 
human, aspartic acid in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 200 .... . , 

(D) OTHER INFORMATION: /note- This is lysine in human, 
glutamine in bovine. This is a very conservative 
substitution . " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 208 , . . u . . , 

(D) OTHER INFORMATION: /note- This is histidme in human, 
asparagine in bovine. This is a very conservative 
substitution. " 



(ix) FEATURE: 

(A) NAME/ KEY: Region 

(B) LOCATION: 226 , . . 
(D) OTHER INFORMATION: /note- "This is isoleucme in 

human, methionine in bovine. This is a very 
conservative substitution." 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 227 



25vii 

(D) OTHER INFORMATION: /note= "This_is glutamic acid in 
human, arginine in bovine. Ihis is not a 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 234 . A . 
(D) OTHER INFORMATION: /note- "This is aspartic acid in 

human, asparagine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 236 . . 

(D) OTHER INFORMATION: /note= This is asparagine in 
human, serine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 240 IT1 . . , . . . 

-(D)- OTHER INFORMATION: /note- This is alanine m human, 
serine in bovine. :.This is -a very conservative 
. ... ■ . substitution. " ' 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 246 , . 

(D) OTHER INFORMATION: /note- This is asparagine in 
human, lysine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE : 

• (A) NAME/KEY: Region 

(B) LOCATION: 258 • „ TU . . , . . . 

(D) OTHER INFORMATION: /note- This is glycine in human, 
glutamic acid in bovine. This is a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 261 u 

(D) OTHER INFORMATION: /note- "This is methionine in 
human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 276 . . 

(0) OTHER INFORMATION: /note- "This is cysteine in human, 
glutamine in bovine. This is not a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 306 _ n . . , 

(D) OTHER INFORMATION: /note- This is lysine in human,. 

asparagine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 310 
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(D) OTHER INFORMATION: /note= "This is cysteine in human. 

serine in bovine. This is a conservative substitution 

(ix) FEATURE: 

(A) NAME/ KEY: Region 

(B) LOCATION: 311 

(D) OTHER INFORMATION: /note- "This is cysteine in human. 

serine in bovine. This is a conservative substitution 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 83 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 105 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modi fi ed-site 

(B) LOCATION : 108 

(D) OTHER INFORMATION: ./note- -"Phosphorylation site;" . ■ 

(ix) FEATURE: " 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 146 ... 
(D) OTHER INFORMATION: /note- "Phosphorylation site' 

(ix) FEATURE: 

(A) NAME/ KEY: Modi fied-site 

(B) LOCATION: 264 n „ 
(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site - 

(B) LOCATION: 117 , „ n n . . 

(D) OTHER INFORMATION: /note- "Potential N-glycosylation site 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 18.. 24 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 275.. 278 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 283.. 289 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Thr Ser Cys His Xaa Ala Glu Xaa Xaa He Xaa Lys Val Ala He 
15 10 15 

Phe Gly Gly Thr His Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys 
20 25 30 
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His Trp Leu Glu Asn Xaa Xaa Glu He Gin Arg Thr Gly Leu Glu Val 
35 40 45 

Lys Pro Phe He Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
J 50 55 60 

He Asp Cys Asp Leu Asn Arg Xaa Phe Asp Xaa "Glu Asn Leu Gly Lys 
65-70 75 80 

Lys Xaa Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He 
100 105 HO 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
115 120 125 

Glu Asp Ser Arg Asn Xaa Phe Leu He Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala -Pro Leu Pro Cys Tyr Val Tyr Leu lie Glu His Pro 
145' - -150 = • ". ■' 155 ------ 16b 

" Ser -Leu Lys Tyr Ala Thr Thr Arg Ser lie Ala Lys Tyr Pro Val Gly 
"- 165 170 - i< o 

He Glu Val Gly Pro Gin Pro Gin Gly Val Leu Arg Ala Asp He Leu 
180 185 190 

Asp Gin Met Arg Lys Met He Xaa His Ala Leu Asp Phe He His Xaa 
195 200 . 205 

Phe Asn Glu Gly Lys Glu Phe Pro Pro Cys Ala He Glu Val Tyr Lys 
210 . 215 220 

He Xaa Xaa Lys Val Asp Tyr Pro Arg Xaa Glu Xaa Gly Glu He Xaa 
225 " 230 235 - • 240 

Ala He He His Pro Xaa Leu Gin Asp Gin Asp Trp Lys Pro Leu His 
245 250 255 

Pro Xaa Asp Pro Xaa Phe Leu Thr Leu Asp Gly Lys Thr lie Pro Leu 
260 265 270 

Gly Gly Asp Xaa Thr Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr 
275 280 285 

Tyr Glu Lys Lys Glu Ala Phe Ala Lys Thr Thr Lys Leu Thr Leu Asn 
290 295 300 

Ala Xaa Ser lie Arg Xaa Xaa Leu His 
305 310 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 83 

(D) OTHER INFORMATION: /note- "Phophorylati on site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 105 

(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 108 , „ nu u 

(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site " 

(B) LOCATION: 146 

(D) OTHER INFORMATION: /note- "Phosphorylation .site 

(ix) FEATURE: 

(A) NAME/KEY:. Modi fied-site / "- ... - 
--• - (B) LOCATION: '264 ..... . . , ^ ^ . * ., 

.(D) OTHER INFORMATION: /note; ."Phosphorylation site 

'(ix) FEATURE: " j 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 117 -, „ -, -■ * - 
(D) OTHER INFORMATION: /note- "Potential N-glycosylation 

site" 

(ix) FEATURE: - 

(A) NAME/KEY: Active-site 

(B) LOCATION: 18.. 24 ■ 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(B) LOCATION: 275.. 278 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 283.. 289 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis' 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Thr Ser Cys His Val Ala Glu Asp Pro He Lys Lys Val Ala He 
15 10 15 

Phe Gly Gly Thr His Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys 
20 25 30 

His Trp Leu Glu Asn Ser Thr Glu He Gin Arg Thr Gly Leu Glu Val 
35 40 45 

Lys Pro Phe He Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
50 55 60 
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He Asp Cys Asp Leu Asn Arg Val Phe Asp Pro Glu Asn Leu Gly Lys 
65 70 lo oU 

Lys Lys Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He 
100 105 110 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
K 115 120 125 

Glu Asp Ser Arg Asn Asp Phe Leu He Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala Pro Leu Pro Cys Tyr Val Tyr Leu He Glu His Pro 
_ 145 150 155 160 

Ser Leu Lys Tyr Ala Thr Thr Arg Ser He Ala Lys Tyr Pro Val Gly 
165 170 1/5 

■ He Glu Val Gly Pro Gin Pro Gin Gly Val Leu Arg Ala Asp He Leu 

. .. 180./' ' _ 185 ■ 190 

' "'Asp Gin Met Arq Lys" Met H elfin 'His Afa Leu Asp Phe He His Asn 
195 " 200 205 

Phe Asn Glu Gly Lys Glu Phe Pro Pro Cys Ala 'lie Glu Val Tyr Lys 
210 215 220 

He Met Arg Lys Val Asp Tyr Pro Arg Asn Glu Ser Gly Glu He Ser 
225 230 235 240 

" ' "Ala He He His Pro Lys Leu Gin Asp Gin Asp Trp.Lys Pro Leu His 

• ; ; - . 245 " '• ' '250 255 

Pro Glu Asp Pro Val Phe Leu Thr Leu Asp Gly Lys Thr lie Pro Leu 
260 265 270 

Gly Gly Asp Gin Thr Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr 
275 280 285 

Tyr Glu Lys Lys Glu Ala Phe Ala Lys Thr Thr Lys Leu Thr Leu Asn 
J 290 295 300 

Ala Asn Ser He Arg Ser Ser Leu His 
305 310 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
AACCCTACTC TTAAGGAC 
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(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) ^STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modi fied_base 

(B) LOCATION: 1 , J u 

(D) OTHER INFORMATION: /mod base= OTHER , ++a ^ hoH 
/note= "The M13 universal primer tag is attached 
to base number 1 . " 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCGGGATGAA AATGGAGAA 

(2) INFORMATION FOR SEQ ID NO: 7: 

Ci) SEQUENCE CHARACTER-TSTICS : 
CA).-4Efcl8TB.c--lR-b3se pairs 

(B) TYPE :- : " nucleic acid •- 

(C) STRAN13EDNES3: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: j? . J u 

(A) NAME/ KEY: modi fied_base 

(B) LOCATION: 1 , ^ 

. (0) OTHER INFORMATION: /mod base=. OTHER . 

/note=."The M13 reverse pn.mer tag is attached to 
. base 1.-" ~-' • " 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
ACCGTGTAAG ATGTAAGC 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
AGGATCAAGA CTGGAAACC 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GTAAGACACC GTGTAAGATG . 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

Ciii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10 

Gly Gly Thr His Gly Asn Glu 

1 , - . 5 

(2) INFORMATION FOR SEQ ID NO:ll: 

; . .(i) SEQUENCE CHARACTERISTICS: 
• ■ ! - ; '---CA.) -LENGTH: 7 amino acids 
. ." -(B) TYPE: amino acid 
.^{.t j^~^"D>^POLOGY: linear 

• 55:Qj4^HM^>^ICAL: NO - • - - - . 
(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Val Asn Glu Ala Ala Tyr Tyr 

1 .5 

TZ). .INFORMATION FOR SEQ ID NO: 12: 

" : "(1 )' SEQUENCE "CHARACTER I ST ICS : 

(A) LENGiH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear • 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Val Xaa Glu Xaa Xaa Xaa Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

Leu Glu Asn Ser Thr Glu He Gin Arg Thr Gly Leu Glu Val Lys Pro 
15 10 15 

Phe He Thr Asn Pro Arg Ala Val Lys Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 14: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

■ ' (v) FRAGMENT TYPE: internal 

■ " - CxTfc' SEQUENCE DESCRI PTION : S:EQ i D NO : 14 : 

x^.-lrf'.fii-Pro-.Leu He 'Pro Xaa/iso Pro Val Phe Leu Thr Leu Asp Gly Lys 
■kj&\ tz uk-' '.. 5 10 15- 

^• i '#?tife-S"er Leu Gly' Gly Asp Gin Thr Xaa Tyr Pro Xaa Phe Xaa Asn 
20 25 30 

Glu Ala Ala Tyr Tyr 
35 

(2). INFORMATION FOR SEQ ID NO: 15: 

rV;.- SEQUENCE CHARACTERISTICS:' 
'■" - ' (A) LENGTH: 24 amino acids 
• : . _ (B) TYPE: amino acid . 
(DrTOPOLOGY: linear 

(iii)" HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Xaa Lys Val Asp Tyr Pro Arg Asn Glu Ser Gly Glu He Ser Ala He 
15 10 15 

He His Pro L^s Leu Gin Asp Gin 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) lYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa Xaa Xaa Ala Leu Asp Phe He Xaa Asn Phe Xaa Glu Xaa Lys Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix-) FEATURE: 

-(A).. .'NAME/ KEY: modified base 
.(B) LOCATION: 15 
: . ' •'?_--([&: 0TH£R INFORMATION: /mod_base- i 

(i-x) FEATURE-: 

(A) NAME/KEY: modified base 

(B) LOCATION: 16 

. (0) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION: 18 

' "(D> OTHER INFORMATION: /mod_base= i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
; AARGTNGAYT AYCCNNGNAA - 20 

(2) INFORMATION FOR SEQ ID NO: 18: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= T 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod base= i 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TGRTCYTGNA NYTTNGGRTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCGTGTACCC AGTGTT 

(2) INFORMATION FOR SEQ ID NO: 20: 

• (i) 5EQUENCE CHARACTERISTICS: 

, . , (A) LENGTH: 20 base pairs 

• ■"]'"-••""•'• '(B) TYPE: nucleic acid 

(C) STRANDEDNtSS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
CTTCTGAATT GCAGAAATCA 

(2L INFORMATION FOR SEQ ID NO: 21: 

- - " (i )' SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 
GTAAGACACC GTGTAAGATG 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: J Jf . J . 

(A) NAME/ KEY: modified base 

(B) LOCATION: 6 , u 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: j x . J u 

(A) NAME/ KEY: modified base 

(B) LOCATION: 9 ~, J u 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: j-jr . 

(A) NAME/ KEY: modi fied_base. 
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(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= 1 

(ix) FEATURE: . ^. J u 

(A) NAME/KEY: modified base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base= i 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGRTANACNG TYTGRTCNCC NCC 

(2) INFORMATION FOR SEQ ID N0:23: 

-■■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

" ' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: - 

- (A) NAME/KEY: modified base 
; . .. (B) LOCATION: 3 ~ " u 

CJ^-- ' (D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: J _ J u 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 ~ J u 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

'■■■■'■■i (A) NAME/KEY: modified base 

. -(B) LOCATION: 9 " J , 

' (D) OTHER INFORMATION: /mod_base= i 

: (ix) FEATURE: J . je , 

(A) NAME/KEY: modified base 

(B) LOCATION: 12 ~ j u 

(D) OTHER INFORMATION: /mod_base= i 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
CCNMGNGCNG TNAARAARTG 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

Asp Cys Thr Val 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /note= "Consensus sequence of catalytic 
center in esterases" 

(ix) FEATURE: 

(A) NAME/ KEY: Region 

(B) LOCATION: 5 

(D) OTHER INFORMATION: /note= "Amino acid 5 is glycine or 
alanine" 

(ix) FEATURE: 

. -- - (A)- NAME/ KEY: Reqion - ... 

• - .-. (B) LOCATION: ? " " " . . ' 

. _(0) OTHER INFORMATION": /note- "Amino acid 7 is glutamic 
acid or aspartic acid" • - 3 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 

Gly Xaa Xaa His Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR" SEQ ID N0:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /note= "Consensus sequence of catalytic 
center in esterases" 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 4 

(D) OTHER INFORMATION:' /note- "Amino acid 4 is phenylalanine or 
valine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

Asp Xaa Xaa Xaa 
1 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(B) LOCATION: 1..7 

(D) OTHER INFORMATION: /note= "Consensus sequence of catalytic 
center in esterases" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
"Val Xaa Glu Xaa Xaa Xaa Tyr 



